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Decentralized Sequential Composite Hypothesis 
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Abstract 

This paper considers the sequential composite hypothesis test with multiple sensors. The sensors 
observe random samples in parallel and communicate with a fusion center, who makes the global 
decision based on the sensor inputs. On one hand, in the centralized scenario, where local samples are 
precisely transmitted to the fusion center, the generalized sequential likelihood ratio test (GSPRT) is 
shown to be asymptotically optimal in terms of the expected stopping time as error rates tend to zero. 

On the other hand, for systems with limited power and bandwidth resources, decentralized solutions 
that only send a summary of local samples (we particularly focus on a one-bit communication protocol) 
to the fusion center is of great importance. To this end, we first consider a decentralized scheme where 
sensors send their one-bit quantized statistics every fixed period of time to the fusion center. We show 
that such a uniform sampling and quantization scheme is strictly suboptimal and its suboptimality can 
be quantified by the KL divergence of the distributions of the quantized statistics under both hypotheses. 

We then propose a decentralized GSPRT based on level-triggered sampling. That is, each sensor runs 
its own GSPRT repeatedly and reports its local decision to the fusion center asynchronously. We show 
that this scheme is asymptotically optimal as the local thresholds and global thresholds grow large at 
different rates. Lastly, two particular models and their associated applications are studied to compare 
the centralized and decentralized approaches. Numerical results are provided to demonstrate that the 
proposed level-triggered sampling based decentralized scheme aligns closely with the centralized scheme 
with substantially lower communication overhead, and significantly outperforms the uniform sampling 
and quantization based decentralized scheme. 
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I. Introduction 

It is well known that the sequential hypothesis test generally requires a smaller expeeted sample 
size to aehieve the same level of error probabilities compared to its fixed-sample-size counterpart. 
For instance, for testing on different mean values of Gaussian samples, [1] showed that the 
optimal sequential procedure needs four times less samples on average than the Neyman-Pearson 
test. Following the seminal work [2] that proved the optimality of the sequential probability ratio 
test (SPRT) in the context of sequential test, a rich body of works has investigated its variants in 
various scenarios and applications. Among them, the composite hypothesis test is of significant 
interest. In particular, [3] generalized SPRT to 2-SPRT; the sequential composite hypothesis test 
was discussed by [4-7] for the exponential families; furthermore, [8,9] studied sequential test 
among multiple composite hypotheses. 

Using multiple sensors for hypothesis test constitutes another mainstream of the sequential 
inference paradigm, motivated by the potential wide application of wireless sensor technology. In 
general, multi-sensor signal processing can be divided into two categories. One features parallel 
structure (also known as the fully distributed scenario), that allows all sensors to communicate 
based upon a certain network topology and reach consensus by message-passing; the other 
features a hierarchical structure and requires a fusion center that makes the global decision 
by receiving information from distributed sensors. In this work, we consider the hierarchical 
type of systems where sensors play the role of information relay. In the ideal case, if the 
system is capable of precisely relaying the local samples from sensors to the fusion center 
whenever they become available, we are faced with a centralized multi-sensor hypothesis testing 
problem. However, the centralized setup amounts to instantaneous high-precision communication 
between sensors and the fusion center (i.e., samples quantized with large number of bits are 
transmitted at every sampling instant). In practice, many systems, especially wireless sensor 
networks, cannot afford such a demanding requirement, due to limited sensor batteries and 
channel bandwidth resources. Aiming at deceasing the communication overhead, many works 
proposed the decentralized schemes that allow sensors to transmit small number of bits at lower 
frequency. In particular, [10] described five (“case A” through “case E”) scenarios of decentralized 
sequential test depending on the availability of local sensor memory and feedback from the fusion 
center to sensors. There, the optimal algorithm was established via dynamic programming for 
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“case E” which assumed full local memory and feedbaek meehanism. However, in resouree- 
eonstrained sensor networks, it is not desirable for sensors to store large amount of data samples 
and for the fusion eenter to send feedbaek. Therefore, in this paper, we assume that sensors have 
limited loeal memory and no feedbaek information is available. 

As mentioned above, deereasing the eommunieation overhead ean be aehieved from two 
perspectives: First, sensors use less bits to represent the loeal statisties; seeond, the fusion center 
samples loeal statisties at a lower frequeney eompared to the sampling rate at sensors. On one 
hand, in many eases, the original sample/statistie is quantized into one-bit message, whieh is 
then transmitted to the fusion eenter. As such, [11,12] showed that the optimal quantizer for 
fixed-sample-size test eorresponds to the likelihood ratio test (LRT) on loeal samples. Then [13] 
demonstrated that the LRT is not neeessarily optimal for sequential deteetion under the Bayesian 
setting, due to the asymmetry of the Kullback-Leibler divergenee between the null and alternative 
hypotheses. [14-16] further investigated the stationary quantization sehemes under the Bayesian 
setting. One the other hand, in order to lower the eommunieation frequeney, all the above work 
ean be generalized to the case where quantization and transmission are performed every fixed 
period of time. These sehemes generally involve fixed-sample-size test at sensors and sequential 
test at the fusion eenter, whieh we refer to as the uniform sampling and quantization strategy. 
Alternatively, [17] proposed that each sensor runs a loeal sequential test and local decisions are 
eombined at the fusion eenter in a fixed-sample-size fashion. Furthermore, [18] proposed to run 
sequential tests at both sensors and the fusion eenter, amounting to an adaptive transmission 
triggered by local SPRTs, though no optimality analysis was provided there. To fill that void, 
[19] defined sueh a seheme as level-triggered sampling and proved its asymptotie optimality in 
both diserete and eontinuous time. However, [18-21] only eonsidered the simple hypothesis test, 
where the likelihood funetions ean be speeified under both hypotheses. 

In spite of its broad speetrum of applieations, the multi-sensor sequential eomposite hypothesis 
test remains to be investigated from both algorithmie and theoretieal perspeetives. Owing to the 
unknown parameters, the LR-based deeentralized algorithms using either uniform sampling or 
level-triggered sampling as mentioned above are no longer applieable. Hitherto, some existing 
works have addressed this problem in the fixed-sample-size setup. For example, [22] developed 
a binary quantizer by minimizing the worst-ease Cramer-Rao bound for multi-sensor estimation 
of an unknown parameter. Reeently, [23] proposed to quantize loeal samples (suffieient statisties) 
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by comparing them with a prescribed threshold; then, the fusion center performs the generalized 
likelihood ratio test by treating the binary messages from sensors as random samples. A similar 
scheme was established in [24] for a Rao test at the fusion center. Both [23,24] assumed that 
the unknown parameter is close to the parameter under the null hypothesis. In [25], a composite 
sequential change detection (a variant of sequential testing) based on discretization of parameter 
space was proposed. 

In this work, we propose two decentralized schemes for sequential composite hypothesis 
test. The first is a natural extension of the decentralized approach in [23], that employs the 
conventional uniform sampling and quantization mechanism, to its sequential counterpart. The 
second builds on level-triggered sampling and features asynchronous communication between 
sensors and the fusion center. Moreover, our analysis shows that the level-triggered sampling 
based scheme exhibits asymptotic optimality when the local and global thresholds grow large at 
different rates, whereas the uniform sampling scheme is strictly suboptimal. Using the asymptot¬ 
ically optimal centralized algorithm as a benchmark^ it is found that the proposed level-triggered 
sampling based scheme yields only slightly larger expected sample size, but with substantially 
lower communication overhead. 

The key contribution here is that we have applied the level-triggered sampling to the decen¬ 
tralized sequential composite hypothesis test and provided a rigorous analysis on its asymp¬ 
totic optimality. Though [26,27] have applied the level-triggered sampling to deal with multi- 
sensor/multi-agent sequential change detection problem with unknown parameters, no theoretical 
optimality analysis was provided there. The main challenge for analysis lies in characterizing 
the performance of the generalized sequential probability ratio test for generic families of 
distributions, which has not been fully understood. To that end, the recent work [28] provides 
the analytic tool that is instrumental to the analysis of the decentralized sequential composite test 
based on level-triggered sampling in this paper. Note that, in essence, [28] studied the single¬ 
sensor sequential composite test, whereas we consider the sequential composite test under the 
decentralized multi-sensor setup in this paper. 

The remainder of the paper is organized as follows. In Section II, we briefly formulate the 


'The performance of the decentralized scheme is supposed to be inferior to that of the centralized one because the fusion 
center has less information from the local sensors (i.e., a summary of local samples within a period of time, instead of the exact 
samples at every time instant). 
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Fig. 1. A hierarchical multi-sensor system consisting of distributed sensors and a fusion center. 


sequential eomposite hypothesis test under the multi-sensor setup. Then we discuss the central¬ 
ized generalized likelihood ratio test in Section III. In Section IV, we propose two decentralized 
testing schemes based on uniform sampling and level-triggered sampling respectively, together 
with their performance analysis. Then in Section V, specific models are studied and numerical 
results are given to further compare the decentralized schemes. Finally, Section VI concludes 
this paper. 


II. Problem Statement 

Suppose that L sensors observe samples yf, i = at each discrete time t, and 

communicate to a fusion center which makes the global decision based upon its received 
messages from sensors, as shown in Fig. 1. Assuming the existence of density functions, the 
observed samples are distributed according to h^{x) under the null hypothesis "Ho and fg{x) 
under the alternative hypothesis "Hi. We assume that 7 and 9 fall within the parameter sets F 
and 0 respectively. Given 7 and 9, the random samples under both hypotheses are independent 
over time and across the sensors. Under such a setup, we arrive at a composite null versus 
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composite alternative hypothesis testing problem: 

-Ho: yi ~ ^7 > 7 e r, 

■Hi : feix), 9 e 0, 

where £ = {1,...,L}. In general, and fe may belong to different families of distributions. 
The goal is to find the stopping time T that indicates the time to stop taking new samples and 
the deeision funetion 6 that deeides between "Ho and Tii, sueh that the expeeted sample size is 
minimized given the error probabilities are satisfied, i.e., 

inf IEa;T, X G r U 0 (2) 

subjeet to supP..y (^ = 1) < a, supPg (5 = 0) < f3, (3) 

7 9 

where Eg denotes expeetation taken with respeet to (w.r.t.) fg and E.^ w.r.t. h^. Note that (2)-(3) 
are in faet (possibly uneountably) many optimization problems (depending on the parameter 
spaees 0 and T) with the same eonstraints. Unfortunately, unlike the simple null versus simple 
alternative hypothesis ease, finding a unique optimal sequential test for these problems is infeasi¬ 
ble, even when a single-sensor or a eentralized setup is eonsidered. Therefore, the approaehes that 
possess asymptotic optimality become the focus of interest. In the following sections, we start by 
briefly introducing the generalized sequential probability ratio test (GSPRT) as an asymptotically 
optimal solution for the eentralized system; then, two deeentralized sehemes will be developed 
based on uniform sampling and level-triggered sampling respeetively. In partieular, we will show 
that the latter seheme is asymptotically optimal when certain eonditions are met. Here we first 
give the widely-adopted definition of asymptotie optimality [19,28]. 


e £, t = 1,2,... 
G £, f = 1,2,... 


( 1 ) 


Definition 1. Let T{a, (3) be the class of sequential tests with stopping time and decision function 
{T', (5'} that satisfy the type-1 and type-11 error probability constraints in (3). Then the sequential 
test {T,5} G T{a,(3) is said to be asymptotically optimal, as a, (3 ^ 0, if 


ExT 

1 < :-^- 

inf{T',5'}er(a,/3) ExT' 


1 + Oq,,/3(1), 


(4) 


or equivalently, ExT ~ inf{T', 5 '}er(a,/ 9 ) E^T'/or every x G T U 0. Here, x ~ ?/ denotes x/y —1 
as x,y ^ oo. 
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III. Centralized Generalized Sequential Probability Ratio Test 


In this section, we consider the centralized scenario, where local samples [yf] are made 


available at the fusion center in full precision. Note that the centralized multi-sensor test is not 
much different from the single-sensor version except that, at each time instant, multiple samples 
are observed instead of one. Since finding the optimal sequential composite hypothesis testing is 
impossible, the solutions with asymptotic optimality become the natural alternatives. In particular, 
the GSPRT is obtained by substituting the unknown parameter with its maximum likelihood 
estimate in the SPRT; alternatively, one can perform an SPRT using the marginal likelihood 
ratio by integrating out the unknown parameters when the priors on unknown parameters are 
available. In this paper, we avoid presuming priors on parameters and adopt the GSPRT. 

Due to the conditional independence for samples over time and across sensors, the global 
likelihood ratio function is evaluated as 





(5) 


Then the centralized GSPRT can be represented with the following stopping time 



( 6 ) 


and the decision function at the stopping instant 



1 if >A 

0 if < -B. 


(7) 


Here St is referred to as the generalized log-likelihood ratio (GLLR) of the samples up to 
time t, and A, B are prescribed constants such that the error probability constraints in (3) are 
satisfied. Practitioners can choose their values according to Proposition 1 given below which 
relates A, B to type-I and type-II error probabilities asymptotically. Before delving into the 
performance characterization of the centralized GSPRT (6)-(7), we recall the Kullback-Leibler 
(KL) divergence between two distributions and fg: 



( 8 ) 
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Assume that the following conditions/assumptions hold, 

Al) The distributions under the null and the alternative hypotheses are strietly separated, i.e., 
inf^ D {feWh^) > e and vale D {h^\\fo) > £ for some £ > 0. This eondition implies that the 
GLLR St takes different drifting direetions in expeetation under the null and the alternative 
hypotheses ; 

A2) D {feWh^) and D [h^Wfo) are twiee continuously differentiable w.r.t. 7 and 6*; 

A3) The parameter spaces T and 0 are compact sets; 

A4) Let 5'(7,6') = log/6i(F) — logh^(F). There exists rf > 1 ,xq such that for all 7 G T, 6* G 
0 , x > xo, we have 



P., (sup|Ve^(7,0)| >x ) 

(9) 


Veee / 


and 

Pe (sup|V.^^( 7 , 0 )| >x) 

(10) 


7er 


This condition imposes that the tail of the first-order derivative of the likelihood ratio w.r.t. 
ox 6 decays faster than any polynomial. 

According to [28], the performance of the GSPRT can be characterized asymptotically in closed 
form, which we quote here as a proposition. 


Proposition 1. [28, Theorem 2.2-2.3] For the composite hypothesis testing problem given by 
(1), the GSPRT that consists of stopping rule (6) and decision function (7) yields the following 
asymptotic performance 


suplogP.^((5T, = 1)- A, 

7 er 


E-y (Tc) ~ 


B 

mie&e D {hf\fe) L' 


sup log Pe = 0) ~ -5, 
0e0 


Ee (Tc) ~ 


A 

D {fe\\h-i) L' 


as A, B 00 . 


( 11 ) 

( 12 ) 


Proposition 1 indicates that the GSPRT, i.e., (6) and (7), is asymptotically optimal among the 
class of L-sensor centralized tests 7))^(a,/9) in the sense that 


Ex (Tc) ~ inf Ej. (T), x G T U 0, (13) 

{T,5}&TTA,0) 

as a = sup..yP^ ((5 tc = 1) ^ 0 and (3 = supgP^ ((5 tc = 0) —)■ 0 [28, Corollary 2.1]. However, 
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as mentioned in Seetion I, in spite of its asymptotie optimality, the centralized GSPRT yields 
substantial data transmission overhead between the sensors and the fusion center; therefore, 
it may become impractical when the communication resources are constrained. Moreover, the 
centralized scheme puts all computation burden at the fusion center. Hence, it is of great interest 
to consider the decentralized scheme where the computation is distributed among the sensors 
and the fusion center, with much lower communication overhead between the sensors and the 
fusion center. 

IV. Decentralized Sequential Composite Hypothesis Test 

In this section, we investigate the decentralized sequential composite hypothesis test, where 
the fusion center is only able to access a summary of local samples. In particular, each sensor 
transmits a one-bit message to the fusion center every Tq (deterministically or on average) 
samples. We first consider the conventional decentralized scheme based on the uniform sampling 
and one-bit quantization. That is, every sensor sends its one-bit quantized local statistic to the 
fusion center every fixed Tq samples. Then we propose a decentralized scheme based on level- 
triggered sampling (LTS), where the one-bit transmission is stochastically activated by the local 
statistic process at each sensor, and occurs every Tq samples on average. Interestingly, we show 
that such LTS-based decentralized scheme provably achieves the asymptotic optimality with 
much lower communication overhead compared with the centralized scheme. 

A. Decentralized GSPRT based on Uniform Sampling and Quantization 

The decentralized scheme based on uniform sampling and quantization is a natural extension 
of the decentralized fixed-sample-size composite test in [23] to its sequential counterpart. Denote 
the sufficient statistic from the jth to the fcth sample at sensor ^ as f (i/j,... ,yi). On 

one hand, at every sensor, the statistic is quantized into one-bit message by comparing it with 
a prescribed threshold A, i.e., 

,i(T„) A sign - a) , (14) 

Note that (14) corresponds to a stationary quantizer that does not change over time and is studied 
in decentralized estimation [24] and detection [23] problems due to its simplicity. On the other 
hand, the fusion center receives q^,i=l,...,L,SLS its own random samples every Tq interval. To 
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that end, the fusion eenter runs a GSPRT on the basis of the reeeived g^’s, which are Bernoulli 
random variables with different distributions under the null and alternative hypotheses [29]: 


T, ^ inf 


r . g A (^0 log (1 - PJ”) + logPe°) 

\ * supper (^0 log (1 - Py°) + r\ hgp'^°) 


(15) 


where (g^(To) = l) , X G { 7 , 6 ^}, and r^,rQ represent the number of received “+1” 

and “- 1 ” respectively, i.e., r* ^ Eti En:nTo<t “ Eti En:nTo<t 1 k=-i}- Upon 

stopping, Til is declared if Gj^ > A, and "Hq is declared if Gj^ < —B, i.e., 

Assuming that conditions A1-A4 listed in the preceding section are satisfied by the Bernoulli 
random samples g^, the decentralized GSPRT based on uniform sampling and quantized statistics 
can be characterized by invoking Proposition 1. That is, as A, B ^ oo, the type-I and type-II 
error probabilities admit 


suplogP^ = l) ~suplogPe ((5 t, = o) ~-5, (16) 

7 er 6»e© 

and the expected sample sizes under the null and alternative hypotheses admit the following 
asymptotic expressions, respectively: 


and 


(mf^c(p™||p5’”) /n)L' 

B 

(inf,Z}(p^°||pJ“)/To)L' 


(17) 

(18) 


It is well known that D \ |p^°) /Tq < D {fe\\h-y) [30], which leads to inf.^ D 11^!^°) /Tq < 
ini^ D {fg\\h:y); therefore, the decentralized GSPRT implemented by (14) and (15) yields sub- 
optimal performance, where the suboptimality is determined by the KL divergence between 
the distributions of quantized sufficient statistics under null and alternative hypotheses. The 
performance also depends on the choice of the quantization threshold A: 


• A can be chosen such that either inf.^ D | \p^°) or inf^ D (jjA° \ \p^°) is maximized. In gen¬ 
eral, these two terms cannot be optimized simultaneously. Therefore, a tradeoff is required 
between the expected sample sizes under the null and alternative hypotheses. 

• Given that typically the expected sample size under the alternative hypothesis is of inter¬ 
est, the optimal A, in general, depends on the unknown parameter {9,'y}. One possible 
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suboptimal solution is to find the optimal quantizer for the worst-ease seenario, i.e., 

A* = arg max min D ■ (19) 

A 9^'y 

Nonetheless, the performanee is expeeted to degrade when the aetual parameters deviate 
from the worst-ease seenario. 

B. Decentralized GSPRT based on Level-Triggering Sampling 

Next, we develop a level-triggered sampling (LTS) seheme for the deeentralized sequential 
eomposite test. Here, eaeh sensor runs its own loeal GSPRT and reports its loeal deeision to the 
fusion eenter repeatedly. And a global GSPRT is performed by the fusion eenter based on the 
reeeived loeal deeisions from all sensors until a eonfident deeision ean be made. As opposed to 
the uniform sampling seheme, the LTS-based deeentralized seheme features asynehronous one-bit 
eommunieation between loeal sensors and the fusion eenter. The idea of running SPRTs at both 
the sensors and the fusion eenter was first proposed by [18] for simple hypothesis test, and was 
further analyzed in [19,20]. In this work, we apply it to the sequential eomposite test. The essenee 
of level-triggered sampling is to adaptively update loeal statistie to the fusion eenter, i.e., transmit 
messages only when suffieient information is aeeumulated, whieh results in substantially lower 
eommunieation overhead and superior performanee eompared with the deeentralized seheme 
based on uniform sampling and finite-bit quantization. For the simple SPRT, level-triggered 
sampling is equivalent to Lebesgue sampling of loeal running LLR. However, sinee the LLR 
is not available in the eomposite ease, we obtain a different proeedure than that in [18-20]. 
Nevertheless, our analysis shows that, in the asymptotie regime, our proposed proeedure inherits 
the same optimality as for the simple test seenario. In the proposed LTS-GSPRT, eaeh sensor 
employes a sequential proeedure instead of a fixed-sample-size proeedure. As we show in the 
following subseetions, sueh a refinement greatly enhanees the performanee of deeentralized 
deteetion and leads to the asymptotie optimality. 

1) LTS-based Approximate GSPRT: Now we derive the LTS-based deeentralized sequential 
eomposite testing algorithm. First let us determine the eommunieation protoeol and one-bit 
message at eaeh sensor. Considering that sensors possess limited memory (i.e., seenario A in 
[10]), every time a loeal deeision is made and transmitted, the eorresponding sensor refreshes 
its memory and runs another GSPRT based on newly arriving samples (Thus the fusion eenter 
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receives i.i.d. information bits). Then the nth transmission time at sensor £ is a stopping time 
random variable recursively defined as 

ti. - inf {t : ^ {-b, a) | , n = 1, 2,..., to = 0, (20) 


with 

t t 

- sup ^ log fe(Vj) - sup log (y^), (21) 

and a, b are prefixed constants. Note that (20) is equivalent to a local GSPRT at sensor i, thus 
different {a, b} lead to different inter-communication period, or sampling frequency by the fusion 
center. Correspondingly, the one-bit message amounts to the local decision, i.e.. 


e A 

uz, = 


-fl, if > a , 

-1, if < -6 


( 22 ) 


Intuitively, (20)-(22) indicate that sensors run GSPRT repeatedly in parallel and their decisions are 
transmitted to the fusion center in an asynchronous fashion. Given the level-triggered sampling 
scheme at sensors, we proceed to define an approximation to the GLLR at the fusion center. 


L N} 


(23) 


£=1 n=l 


where Nf = max{n : < t}. The fusion center stops receiving messages at the stopping time 


and makes the decision 

^ f 1 if Vj > A, 

r A I Ip — ’ 

^Tp - < 

0 if < -B. 


(24) 


(25) 


In effect, as we will see later, (24) amounts to an approximation to the GSPRT at the fusion 
center based on the received one-bit messages (u^. The proposed decentralized sequential 
composite test procedure based on level-triggered sampling is summarized as Algorithm la-lb. 


2) A Closer Look at the LTS-based Approximate GSPRT: Next we discuss how Algorithm la- 
lb approximates the optimal procedure, i.e., GSPRT, at the fusion center. The optimal rule at 
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Algorithm la : Repeated GSPRT at Local Sensors 

1 : Initiali^tion: t 0, •(— 1, 0 

2 : while e {—b, a) do 
3: t t + 1 ^nd t^e new sample yf 

4 : Compute = Sjf aeeording to (21) 

5: end while 

6 : tg i — t 

7 : Send to the fusion center 

8 : Reset S'^ 0 and go to line 2. 


Algorithm Ih : Global GSPRT at Fusion Center 

1 : Initialization: V 0 

2 : while —B <V<Ado 

3 : Listen to the sensors and receive information bits, say, tq “+r’s and ri l”s 

4: V V + Via — r^b 

5: end while 

6 : HV>A then decide "Hi 
7: else decide Ho 


the fusion center is to compute the LLR of the local GSPRT decisions, i.e.. 


L Ni 


£=1 n=l 


iQg if = 1 

£/_ /IN A 1 ^6 “'n > 


and <( 7 ,^) = 


logy^ if< = -l, 


(26) 

(27) 


where w^( 7 , 9) is the LLR of the Bernoulli sample |/^, and and jSo are the type-I and type-II 
error probabilities respectively at the local sensor, i.e.. 


= Pt,(m„ = 1), 




(28) 


Note that Vt{'y,9) is again a function of the unknown parameters since the distribution of 
varies with 7 and 9. To that end, employing the GSPRT as that in ( 6 ) and (15), the original 
global stopping time is expressed as 

inf : inf sup 1 ^ 4 ( 7 , 9) ^ {—B, A) | . (29) 
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The global GSPRT involves solving the maximization in (29) whenever a new message is 
reeeived. However, unlike in the uniform sampling ease, solving this optimization problem is 
no easy task sinee the distribution of as a funetion of 6 and 7 is unelear. Aiming for a 
eomputationally feasible algorithm, we eontinue to simplify (29) in what follows. Using (26)- 
(27), 


L Nf 


inf sup Vt( 7 , 6) = inf sup V V log ^{ui=i} + log 


£=1 n=l 
L N‘ 


1 — a.. 


= 


Ul=-i} 


7 9 


L N 


inf sup EE -loga^l{„^=i} + \ogl3el{ui=-i} 


l=l n=l 


£=1 n=l 


EE( - sup log ^{ui=i} + sup log /3g l{ui=-i} ) , as a, 6 -)■ CX), (30) 


0e0 


Therefore, denoting a = sup^a^,(] = sup^/Sg, the global GLLR is approximately a simple 
random walk proeess 


L Ni 

EE( -logo l{ui=l} + log/3 1{ui=- 

1=1 n=l 



m 


1=1 n=l 


1 } - , 


(31) 


due to Proposition 1. The above expression implies that the stoehastie proeess V) as defined in 
(23) approximates the GLLR inf.^ supg 14 ( 7 , 0). 


C. Performance Analysis of the LTS-based Decentralized Scheme 

In this subsection, we show that the LTS-based decentralized scheme serves as a superior 
solution to the uniform-sampling-based scheme because it preserves the asymptotic optimality 
of the centralized scheme. This interesting property allows us to achieve the same centralized 
asymptotic performance, but consuming significantly lower communication resources. In par¬ 
ticular, the expected sample size under the null and alternative hypotheses are characterized 
asymptotically by the following theorem. 

Theorem 1. In the asymptotic regime where b,a ^ 00 and A/a, B/h ^ 00 , the expected sample 
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sizes of LTS-GSPRT admit the following asymptotic expressions 

B 


(Tp; 


+ o {B ), 


mie D {hf\fe) L 

Ee (Tp) ~ —— . , . + o (A). 

inf/y D (/i911 h'y) L 


(32) 

(33) 


Proof: See Appendix A. ■ 

Notably, as opposed to that of the uniform sampling seheme in (17)-(18), the expeeted sample 
sizes of the proposed LTS-based deeentralized seheme preserve the KL divergences between fg 
and as the denominators. In fact, (Tp) and Eg (Tp) increase with A and B at the same rate 
as that of the centralized GSPRT (cf. (12)). We next proceed to relate the type-I and type-II error 
probabilities of the LTS-based decentralized scheme to the global decision thresholds {—B,A} 
by the theorem below. 


Theorem 2. In the asymptotic regime where b,a ^ cxo, limsup a/fe < oo, limsup6/a < oo and 
A/a, B/h ^ oo, the type-1 and type-11 error probabilities of the LTS-GSPRT admit the following 
asymptotic expressions: 


suplogP^((5Tp = 1)- A, (34) 

7er 

sup log Pe (5 tp = O) ~ -B. (35) 

0e0 

Proof: See Appendix B. ■ 

Combining (32)-(35), we arrive to the following conclusion on the asymptotic optimality of 
the proposed LTS-based decentralized algorithm. 

Corollary 1. Let 7/ (a, jd) be the class of any L-sensor decentralized sequential tests, of which 
the type-1 and type-11 error probabilities are bounded by a and (d respectively. Then the proposed 
LTS-based GSPRT {Tp,(5Tp} is asymptotically optimal within this class, i.e.. 

Ex (Tp) ~ inf Ex (T), x G T U 0, (36) 

{T,5}er/(a,/3) 

as a ^ sup.^ ((5 tp = l) —)■ 0 and fd = sup^ ((5 tp = O) —)■ 0. 

Proof: Given the same error probabilities a = sup^^P.^ ((5 tc = 1) = sup..yP^ ((5 tp = l) and 
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f5 = sup^P^i (^Tc = 0) = supgPg (^Tp = O), the expected sample sizes of the centralized and 
LTS-based decentralized scheme Tp admit the following asymptotic performance, as a,P —)■ oo: 


E^Tc ~ E^Tp ~ 


E^Tc ~ EgiTp ~ 


-log/3 

inie D (h^Wfe) L' 

— log a 

inf^ D (/el 1^7) L 


(37) 

(38) 


These expressions suggest that the LTS-based decentralized scheme inherits the asymptotic 
performance of the centralized GSPRT. As a result, it is also safe to say that LTS-GSPRT 
is asymptotically optimal among the decentralized schemes that satisfy the same error rate 
constraints, since 


1 < 


E^Tp 


l'^l'{T,5}er/(a,/3) 1^3:T 


< 


ExTp 


inf 


{T,5}eT£{a,l3) 


E,pT 


E^-Tp 




E^T^ imf ,6^S‘T£(oi,l3) Ea^T 


1 + Oq:,/3(1). 


(39) 


The second inequality holds true by noting that no decentralized scheme can outperform the 
centralized one because less information is available at the fusion center. The last asymptotic 
relation is obtained by using (37)-(38) and the conclusion in Proposition 1, i.e., E^-Tc ~ 

illf{T,5}eri^(a,/3) Ea;T. ■ 


Recall that, for the simple null versus simple alternative hypothesis test, where the SPRT is 
optimal, the centralized and LTS-based decentralized SPRT (denoted as Tc and Tp respectively) 
provide the following asymptotic performance [20,21]: 


E^Tc ~ E^Tp ~ 


-log/3 

Dih,\\fe)L^ 


EqTc ~ EeTp ~ 


— log a 

D{f0\\h,)V 


(40) 

(41) 


where a = (5 = 1), /9 = Pg (5 = 0). Compared to the simple test where parameter values 

are given, the proposed sequential composite test requires larger expected sample sizes under 
both hypotheses (since the expected sample sizes are inversely proportional to inf.^gr D (feWh^), 
inig^Q D (h^Wfg) instead of D{fg\\h^), D{h^\\fg), as seen by comparing (37)-(38) and (40)- 
(41)). This is the price we pay for not knowing the exact parameters. 
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V. Numerical Results 


There are a wide range of applieations where the deeentralized sequential eomposite hypothesis 
test plays an important role. In this seetion, we apply the proposed eentralized and deeentralized 
sequential tests to two examples: one is to deteet the mean shift of Gaussian random samples; 
and the other involves speetrum sensing in eognitive radio systems. 


A. Mean Shift of Gaussian Random Samples 

Deteeting the mean shift of Gaussian random samples has many applieations. For example, 
suppose we intend to detect the presence of a unknown parameter 6 as soon as possible in the 
environment contaminated by white Gaussian noise. Here 6 could be the energy of an object that 
is monitored by a wireless sensor network or a multi-station radar system. The target parameter 
is assumed to be within a certain interval, i.e., 6 E [6*o, 6*i], 6*o > 0. Then we have an L-sensor 
hypothesis testing problem: 


"^0: yl = 4^ ^ e £, f = 1,2,... 

Hi-. yf = e + el o<eo<e<ei, eec, t = i,2,... 


(42) 


where ~ Af{0, cx^). Sensors are able to transmit one-bit every Tq sampling instants on average. 
For this model, both fe and are Gaussian probability density functions and 7 = 0. The 


sufficient statistic of the jth to /cth samples at sensor £ is their summation, denoted as 
gkj,t A Y4=jyi - First of all, we verify that the log likelihood ratio of yf, i.e.. 



(43) 


satisfies the conditions A1-A4. While conditions A2-A3 are easily verified, conditions A1 and 
A4 require the following check: 

• The KL divergence admits D (feWh^) = D (hflfe) = (6* — 7)^/(2(T^). By choosing 0 < 


c < ^, we have D{fg\\ho) = ^ > c and mig^<g<e^ D{ho\\fg) = ^ > e; 
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• For (9), let x > xq > , then we have 


( sup I V6)S'(6', 7)1 > x 

\eo<e<ei 

=P^ ( sup \yl — 9\> xa" 

\eo<e<ei 

=^7 (iVt - ^o| > xa^; vt > 


t D \ \ ^0 + , m (\..1 D I ^ „.e ^ ^0 + ^1 


+ P 7 Ivt - ^il > xa ] yt< 




P^ {yl > xa"^ + 6 * 0 ) + P-y {yl < -xa"^ + 61) 
xa^ + Oq — 'y\ f —xa^ + — 7 


a 


$ 


a 


(44) 


Note that $ (—x) ~ for large x, henee we ean always find a suffieiently large xq > 
^]^ 2 ° sueh that x"^ > \ logxl’^ , or equivalently, Py (sup 0 jj< 0 < 0 ^ | V 6 )S'( 6 ', 7) | > x) < 
for a; > a;o ,7 > 1. Similarly, we ean show that (10) holds as well. 


Therefore, Proposition 1 and Theorems 1-2 ean be applied to eharaeterize the asymptotie perfor- 
manee of the eentralized GSPRT and LTS-based GSPRT for the problem under eonsideration. 


To implement the eentralized GSPRT in (6)-(7), the global GLLR at the fusion eenter is 
eomputed as 

S^j = sup ^ Sy - L{k- j + 1)^^ /cT^ 


i=i 


( L 




\ 


i=\ 


/u^ 


with = S { x;[=i SY/ {k - j + 1) /L,eo,ei], and 


(45) 


S{x,9o,e,)^{ 


X, iixe[9o,9i], 
9i, if X > 9i, 

9o, if x < 6 * 0 - 


(46) 


Substituting St in (6)-(7) with Si eomputed by (45) gives the eentralized GSPRT (C-GSPRT). 

For the LTS-based GSPRT (LTS-GSPRT), note that the parameter MLE at sensor i based 
on the jth to kth samples is straightforwardly eomputed as 9^’^ = £ {k — j + 1), 9^, j. 
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which leads to the local GLLR statistic at sensor t. 


Sf 


/ 

I 

V 


{k- i + 1 ) 



2 

/ 



(47) 


Substituting (47) into (20) and (21), the LTS-GSPRT ean be implemented aeeording to Algorithm 
la-lb. 


To implement the uniform sampling based GSPRT (U-GSPRT), we quantize the suffieient 
statisties at the nth transmission period at loeal sensors by 

= sign - a) . (48) 


Given the threshold A, and the distribution of statistie 


c«Tb,£ 

‘^(n-l)To-rl 


rs_/ 


A/'( 0 ,(T^To) under Hq, 
Af{eTo,a^To) under Hi, 


we have the distribution of Bernoulli samples as 


(49) 


P*W=l)=P?W = l-'S‘(^^). ie{0,|9o,eil}. (50) 

Again we first verify that the log likelihood ratio of i.e., 

S49,7) = «'log^ + (l-,yiogif^ (51) 

satisfies conditions A1-A4. Specifically, A2-A3 is easy to verify, and we eheck A1 and A4 as 
follows: 


• Sinee p^° ^ for all Oq < 9 < 9i and 7 = 0, it is guaranteed that D {p^°\\pq°) and thus 
infe 79 {Po°\\P 0 °) are positive, and there exists an e > 0 sueh that 79 (pJ°||Po°) > ^ 
infe79 > e. 
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To verify (9) for S'„ ( 7 , 9), we have 


( sup |V0S'„(6>,7)| > x ) 
\eo<9<9i / 


=P^ I sup 

\9q<9<9i 

=P^ sup 

\ 6»o<6»<6»i 


=Pt, sup 

\^eo< 0 <ei pf{l-pf) 


1 — of dp^° 



> X 


> X 



€ - 


+ P^ ( sup 

*^eo<e<ei do 


> x; = 0 


=P7°lf 8p^0 T 'l+(l“P?)lf 8P^0 T 1’ 

sup0p<e<e^ -^/Pe°>^ ^ { supeQ<e<ei -g|-/(l-Pe°)>a: > 


Note that ^ = ^exp(-(A - 0T„f j(2a^T„)) < and 0 < pJJ < pf < 
whieh lead to 


(52) 

pS £ P? £ Pfc < 1 . 


ap? , T„ , V7i 1 J 9p? V% 1 

sup - 1 ^ Pa < ^-^ and sup ^r^/(l — Pfl ) < ^- 

6»o<0<01 y/^a Pg° 9o<9<9l 90 ^/^al—pg° 

Henee, by letting xq = max i k we have P^ (sup^ |V 0 S'„( 6 ', 7 )] > x) 

( PSq POi ) 

0 < all x > Xo ,7 > 1. Similarly, eondition (10) holds as well. 


As a result, the performance of U-GSPRT can be characterized asymptotically by (16)-(18). 

Next, we solve for the constrained MLE of the unknown parameter up to nth transmssion 
period: 

ro log (1 - Pe°{\)) + < logp0° (A) 

where Tq and r” represent the number of received “—1” and “+1” respectively among the first 
received n bits. By noting that the objective in (53) is a concave function of 0, we can invoke 
the optimality condition and find the MLE as ^A — ^j cr/\/Tjj, 6 ^ 0 , 6*1 j. 


On = argmax 
9>9o 


= argmax 
9>9o 
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Fig. 2. Expected samples versus false alarm probability a. 


In the simulation experiment, we set the algorithm parameters as follows. The noise variance 
is normalized as one, i.e, cr^ = 1. The parameter interval is 6* G [0.4,2]. The U-GSPRT 
is implemented in two settings, i.e., the inter-communication period Tq = 10 and Tq = 1 
respectively. The expected inter-communication period for the level-triggered sampling scheme 
is fixed as approximately ETq ~ 10 by adjusting the local thresholds {a,b}. In both cases, 
the binary quantizer in the minimax sense, i.e., the threshold that solves (19), is found to be 
A/To ^ 0.32. 

In Figs. 2-3, the performances of C-GSPRT, U-GSPRT and LTS-GSPRT are examined based 
on a two-sensor system. Specifically, Fig. 2 depicts the expected sample size under the alternative 
hypothesis (with 9 = 0.4) as a function of the false alarm probability, with the miss detection 
probability equal to /) ~ 10“"^. Fig. 3 depicts the expected sample size under the null hypothesis as 
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a function of the miss detection probability, with the false alarm probability equal to a ~ 10“'^. 
In these two figures, the black solid lines correspond to the following asymptotic formulas 
respectively (cf. (37)-(38) without the o(-) terms), 

Y j _ - log« _ - log g -log/^ _ -log/^ 

' DU'9\\ho)L e^L/2^ ° mUD{ho\\fe)L 011/2' 

Note that since the true parameter in the experiment is 6*o = 0.4, infg D{ho\\fo) = i7(/io||/e), 
the black-solid lines in Figs. 2-3 also correspond to the performance of SPRT for the simple 
null versus simple alternative test. As expected, both C-GSPRT and LTS-GSPRT align closely 
with the asymptotic analysis. Notably, LTS-GSPRT only sacrifices a fractional sample-size 
compared to C-GSPRT while yielding substantially lower overhead through low-frequency one- 
bit communication. Figs. 2-3 also clearly show that U-GSPRT diverges from C-GSPRT and 
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Fig. 4. Expected sample size versus varying parameter values. 


LTS-GSPRT by an order of magnitude due to the smaller value of the KL divergenee (i.e., 
D{fe\\ho) = 0.08 > /lO D {pl\\pl) Ri 0.051 and mie D (hoWfe) = 0.08 > 

infe D {pl\\pl) ~ 0.050 > inf^ D |pe°) /lO 0.042). Note that we also plot the performanee 
of U-GSPRT for Tq = 1 that eorresponds to a binary quantization at every instant. It is seen 
that even with ten times more frequent eommunieation to the fusion eenter, U-GSPRT is still 
outperformed by LTS-GSPRT substantially. 

Fig. 4 illustrates the performances of C-GSPRT, U-GSPRT, LTS-GSPRT for varying parameter 
values. Note that all algorithms are implemented without this knowledge, hence this figure 
shows how they adapt to different parameter values, which is a critical performance indicator 
for composite test. The error probabilities are fixed at a ^ 2x 10“"^, /3 ~ 10“'^. As 9 varies from 
0.4 to 2, the fusion center samples faster from the sensors, i.e., Ee(To) ~ 10 —)■ 1.5, due to the 
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Number of sensors: L 


Fig. 5. Expected sample size versus number of sensors. 


embedded adaptive meehanism. Meanwhile, U-GSPRT with the best time resolution Tq = 1 is 
examined. It is elearly shown in Fig. 4 that LTS-GSPRT is able to align with C-GSPRT elosely 
and eonsistently outperforms U-GSPRT over all parameter values. Again, LTS-GSPRT results 
in the lowest eommunieation overhead among these three tests. 

Fig. 5 further examines the eentralized and deeentralized algorithms under different number of 
sensors. The error probabilities are fixed at a 2 x 10“'^, /3 ~ 10“^. Clearly, using more sensors 
brings down the sample size given a target aeeuraey. It is seen that, for a reasonable number of 
sensors in praetiee, e.g., eight sensors, LTS-GSPRT stays elose to the eentralized seheme and 
eonsistently exhibits smaller sample size eompared to the uniform sampling based deeentralized 
seheme. 
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B. Collaborative Sequential Spectrum Sensing 

In this subsection, we consider the collaborative sequential spectrum sensing in cognitive 
radio systems. To cope with the ever-growing number of mobile devices and the scarce spectrum 
resource, the emerging cognitive radio systems enable the secondary users to quickly identify the 
idle frequency band for opportunistic communications. Moreover, secondary users can collaborate 
to increase their spectrum sensing speed. Specifically, if the target frequency band is occupied 
by a primary user, the received signal by the £th secondary user can be written as 

yl = h^st + e[ (54) 


where h[ ~ AA(0,1) is the normalized fading channel gain between the primary user and the 
£th secondary user, independent of the noise ef and St is the unknown signal transmitted by the 
primary user with energy E|stp; otherwise if the target frequency band is available, secondary 
users only receive noise. To this end, the sequential spectrum sensing can be modelled as the 
following composite hypothesis testing problem [31]: 


Uq: i/f ~ AC ( 0 , 7 ) , 0 < 7 o< 7 < 7 i, £ G £, t = 1 , 2 ,..., 

Hi: 7i<^o<^<^i f e £, t = 1,2,..., 


where the parameter intervals [ 70 , 71 ] and [ 6 ^ 0 , ^i] are prescribed by practitioners. 
We begin by verifying that the log-likelihood ratio of yl, i.e.. 




1 f\yl? \y. 


7 


£12 

t \ 


9 


, 1 , 7 

+ 2 log 


0 ’ 


(56) 


satisfies the conditions Al-AA. While conditions ^42-^43 are easily verified, conditions A1 and 
A4: can be checked as follows: 

• The KL divergences admit 


D{fe\\h,) 
and D{h^\\fe) 


1/7 .4 1 1 


which are both decreasing functions of 7 and increasing functions of 6 . Let 0 < e < 

mm{D{h^^\\fe^) ,D{h^^\\feo)}, we have D {fg\\h^) > T)(/eJ|/i^J > £ and 

mief,<e<ei Dih^Wfe) > D{h^^\\feo) > e; 
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• For (9), let X > ^ > 0, then we have 



(57) 



\/^6*o 


(58) 


where the inequality holds beeause > 0, 6* > 0 and \{{yiY} — 0\ < max{(|/f)^, 9}, and 
(57) holds beeause x > Again, sinee $(—\/^6*o/i/7) ~ we ean always find a 


suffieiently large Xq sueh that x > | logx|^ , or equivalently, (sup0jj<0<0^ |V0^(0,7)| > x) < 
g-|ioga:|’) X > xo,ri > 1. Similarly, we ean show that (10) holds as well. 

With A1-A4 satisfied, we proeeed to employ the eentralized and LTS-based GSPRTs to solve 
the eollaborative sequential speetrum sensing problem, whieh ean be eharaeterized asymptotieally 
by Proposition 1 and Theorems 1-2. Partieularly, the eentralized LLR at the fusion eenter is 
evaluated as 



T U 



(59) 


As sueh, the eentralized MLE of the unknown parameters 7 and 6 are easily obtained as 7^ 


£ {Wj/ {k — j + 1) /L,7 o, 7 i) and 9^ = £ {Wj / {k — j + 1) /L,9o,9i). Then the eentralized 


GSPRT given by (6)-(7) ean be implemented based on the GLLR Sj = Sj 
implement LTS-based GSPRT, the loeal LLR at sensor I is 


(7^, In order to 



k 


(60) 
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Substituting 7*'’^ = E (fc - j + 1 ), 70,71) and 6 ^ = £ {k - j + 1 ), 6>o, 6>i j into 

(60) gives local GLLR which is further plugged into (20)-(21) to run the LTS- 

GSPRT Tp. To realize U-GSPRT for this problem, given the inter-communication period Tq, the 
sufficient statistic is found to be 0^’^ = Wj’^, which is defined in (60), with different distributions 
under the null and alternative hypotheses: 

(0), (0) . (61) 
Therefore, the binary quantizer for this problem is written as 

qi = sign - a) , (62) 

whose distribution is 

= 1 - ^To , a: e [ 70 , 7 i] U [0o, ^ 1 ], (63) 

where (x) is the CDF of the chi-squared distribution with degree of freedom k. By solving the 
maximum likelihood problem, it is straightforward to find the estimates of 7 and 9 respectively 
as 

Note that the log likelihood ratio of is the same as (51) with p^°{X) replaced by (63). 
Therefore, conditions A 1 and AA are verified by noting that 

• Pe°W for all 9 e [6»o,6'i] and 7 e [70,71], given any A; 

• supgjj<g<gj ^%~/p1)° ®^P 0 o< 6 »< 6 »i ^|“/f ~ Pe° bounded, thus the same argument as in 

(52) applies. This is seen by recalling the density function of the chi-squared distribution, 

M! = Ar' ^ 

dx x^^^°\x)-\x) 2^o/2r(To/2)a:’ 

with X residing in a compact set, i.e., X e [70,71] u [00,6^1]. 

Then we can also asymptotically characterize the performance of U-GSPRT in the sequential 
spectrum sensing problem by (16)-(18). 

In the simulation experiment, the parameter intervals of interest are set as 7 G [0.2,1] and 
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6 E [2,5]. We consider U-GSPRT with the best time resolution Tq = 1, where the minimax 
quantizer A = aigmaxmme D (fgWh^) Ri 3.8. The expected inter-communication period for 
LTS-GSPRT is again set approximately as ETq ~ 10. 

In Fig. 6-7, the performances of two-user C-GSPRT, U-GSPRT and LTS-GSPRT are examined 
with 7 = 1, 6* = 2 in terms of the expected sample size (i.e., spectrum sensing speed) as a function 
of the false alarm probability and miss detection probability respectively (with (3 ~ 10“^ in Fig. 
6 and a ~ 10“^ in Fig. 7). In both figures, the asymptotic optimality of LTS-GSPRT is clearly 
demonstrated as it aligns closely with C-GSPRT. In contrast, U-GSPRT diverges significantly 
from C-GSPRT and LTS-GSPRT due to the smaller values of the KL divergence. Furthermore, 
Fig. 8 compares the three sequential schemes for different parameter values and Fig. 9 further 
depicts their performances with different number of collaborative secondary users with the error 
probabilities a ~ 10“'^. Note that, although U-GSPRT sends local statistics to the fusion 
center every sampling instant, it is consistently outperformed by LTS-GSPRT where each user 
transmits the one-bit message only every ten sampling instants on average. More importantly, 
LTS-GSRPT only compromises a small amount of expected sample size compared to the C- 
GSPRT while substantially lowering the communication overhead. In cognitive radio systems, 
such an advantage brought by LTS-GSPRT allows the secondary users to identify available 
spectrum resource in a fast and economical fashion. 

VI. Conclusions 

This work has investigated the sequential composite hypothesis test based on data samples from 
multiple sensors. We have first introduced the GSPRT as an asymptotically optimal centralized 
scheme that serves as a benchmark for all decentralized schemes. Next a decentralized sequential 
test based on conventional uniform sampling and one-bit quantization has been studied, which is 
shown to be strictly suboptimal due to the loss of time resolution and coarse quantization. Then, 
by employing the level-triggered sampling, we have proposed a novel decentralized sequential 
scheme, where sensors repeatedly run local GSPRT and report their decisions to the fusion center 
asynchronously, and an approximate GSPRT based on the local decisions is performed at the 
fusion center. The LTS-based GSPRT significantly lowers the communication overhead through 
low-frequency one-bit communication, and is easily implemented both at sensors and the fusion 
center. Most importantly, we have shown that the proposed LTS-based decentralized scheme 
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Fig. 6. Spectrum sensing speed versus false alarm probability a. 

achieves the asymptotical optimality as the loeal thresholds and the global thresholds grow large 
at different rates. Finally, extensive numerieal results have eorroborated the theoretieal results 
and demonstrated the superior performanee of the proposed method. 

Appendix 

A. Proof of Theorem 1 

We first introduee the following result as an extension to the Wald’s identity, that ean be found 
in [19, Lemma 3]. 

Lemma 1. Let be defined by (20). Consider a sequence {fn} of Ltd. random variables 
where each is a function of the samples i/S ,,, • • •, ylt acquired by sensor i during its nth 
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Fig. 1. Spectrum sensing speed versus miss detection probability p. 


inter-communication period. Then the following equality holds: 

/7Vf+l \ 

E. ( S = E40 + 1) , xGTue. (65) 


Here, (65) differs from the standard Wald’s identity because Nj + 1 is no longer a stopping 
time adapted to {fi}. Next we proceed to analyse the expected sample size under level-triggered 
sampling. Since the proof is concentrated on the LTS-based decentralized scheme only, we use 
T for Jp (cf. (24)) for notational simplicity. 
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e 



Fig. 8. Spectrum sensing speed versus different parameter values with and without the primary user. 
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Number of users: L 


Fig. 9. Spectrum sensing speed versus different number of collaborating secondary users. 


Proof of Theorem 1: Note that the global statistic in (23) can be rewritten as 

L Nf 
l=\ n=l 

L (Ni+1 \ 

£=1 y n=l J 

L N^+l L 

£=1 n=l £=1 


( 66 ) 
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Thus, invoking Lemma 1, we have 


E. (Lt) (iV| + l) E, 


xGTU©. 


(67) 


i=i 


i=i 


Denote the inter-eommunieation period t^n~ ^n-i- Further define = J2n=i^ T? — T > 0, 
by noting that T < J2n=i^ ^ result, we ean write down the following equality for eaeh 

sensor: 


Eo 


/7Vf+l \ 

(T + flf)=Ei( =E.(]V' 


+ 1)E44), ( = 


Combining (68) and (67) yields 




£=1 




e=i 


(T) E +E 


^ E. (ri) 


£=1 


Furthermore, aeeording to Proposition 1, we have 


Eg (vi) ~ a ~ Eg (r^ inf D {fe\\h^), as a -)■ 0, 
and E^ (u^) ~ ~ -E^ (r,^) inf D (h^Wfe), as (3^0. 


e 

Considering the sensor samples under hypothesis "Hi, (69) beeomes 

L L 


K 


whieh leads to 


Efl 


Ee (T) = 


t'd+EtiK' A + Y.l,n^ 


=1 '4 


D {feWh^) L m.i^D{fe\\h^)L 


, as Oi —^ 0, /3 —^ 0. 


( 68 ) 


(69) 


(70) 

(71) 


Ee (Lt) = E, (T) mi D if 0 \\h,) - (e^ - E,(i?,) ini D {fe\\h,) ) , (72) 

£=1 £=1 A_Z_E 


(73) 
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Similarly, substituting (71) into (69) gives 


E, (VV) = -E, (T) y] inf D (ft,| I/s) - (e, («„,,.,) + E,(B,) inf D (ft,||/„)), (74) 

e=i (.=1'' -^ 


ni^ 


and the expeeted sample size under the null hypothesis is 


E, (T) = 




infsD(ft-,||/e)L infsZ) (A,||/s) L 


3.S Oi —y 0, /3 —y 0. 


(75) 


We also have Ee ^ ^ as A, B ^ oo and a = o (A) ,b = o (B). Note 

that TZq and TZi^ only depend on loeal thresholds {b, a}, whieh are of a lower order of {B, A}; 
therefore, we have proved the asymptotie formulas (32) and (33). ■ 


B. Proof of Theorem 2 

The proof eonsiders the asymptotie regime where A/a —)■ oo, B/b —)■ oo and limsup a/b < oo. 
Again, T is used for Tp for notational simplieity. 

Proof: For simplieity of notations, we assume L = 2 in the proof. When L > 2, the proof 
is similar and is thus omitted. Thanks to the symmetry of type I and type II error probabilities, it 
is suffieient to eompute the type I error probability. For any 7 G F, we eonsider the probability 

> A). (76) 


We first define the loeal diseretized approximated generalized log-likelihood ratio proeess, 

N‘ 

^ - bl^ui=-i}, ^ = 1, 2,..., L. 


n=l 

Then (76) has the following upper bound 


P 7 (W > 21) < P.^(sup Vt> A) <¥J sup + sup > a) . (77) 

t ft i t 2 

The first inequality is due to the definition of T, and the seeond inequality is because sup Vt < 
X]^=iSupj We proceed to split the last probability in (77) into error probabilities detected 
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by the local sensors. Let e be an arbitrary positive constant, then 

f sup + sup > a) 

V t t ' 

Li/eJ 

< ^ (^keA < sup < {k + l)eA, sup > (1 — (^ — 

k=l ^ * 

+P.y ( sup < eA, sup > (1 — £)a') . 

V t t ' 


Note that the stochastic processes : t > 0} and : t > 0} are independent and 

identically distributed, so the right-hand side of the above inequality equals to 

Li/eJ 

¥^{^8A < sup < {k + l)e74jp.^^sup > (1 — (/s — 

+P.y(sup < eA)¥^ ( sup > (1 — ; 

t V t ^ 


which can be further bounded above by 

Ll/eJ 



For each k such that 1 < k < L^J, we have e < ke < 1 and (1 — (fc — l)e) = 1 — ke + s. 
Consequently, (78) can be further bounded above by 


e ^ sup P.y ( sup ( sup > (1 — P + £)a') . (79) 

pe[£,i] ^ t / V t / 

Then we use the following lemma whose proof is given below to complete the proof of Theorem 

2 . 


Lemma 2. For e > 0 and p> e, 

P^(^supL/^^ > p2l) < as A ^oo. 

The above limit is uniform with respect to p and 7. 


Applying Lemma 2 to (79) gives the result in Theorem 2. ■ 
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Proof of Lemma 2: To start with, we write in terms of the sum of i.i.d. variables, 


Nt 




{< = !} - 


72=1 


Therefore, the event {sup^ 1//^^ > pA] is the same as the event 


N 

{sup^y^ - 


72=1 


where 


Yn fo]L|^l=_ij, TL 1, 2, .. 


The above event is further equivalent with the event 


{N* < cx>}, 


where N* = inf{iV : Y^ > pA}. Therefore, 


¥J supy/^^ >pA] = ¥JN* < oo). 


We apply a ehange of measure to provide an upper bound to the above expression. Let P and 
Q be probability measures under whieh Yn,n = 1, 2,... are i.i.d. random variables and 


P(y„ = a) = p and PC^n = —h) = 1 — p, 


and 

Q(yn = a) = q and P(y„ = -b) = I - q, 


where p = (e“ — e ^(1 — e and g = (e“ — e ^e“(l — e ^). With a ehange of measure, 
we have 


F^{N* <oo) = E^ 


'APn* dPvT* 

-^—; N < oo 


(80) 


where and denote the likelihood ratios between P.^ and P, and between P and Q at 

cflPjv* ciQjv* ^ 

the stopping time N* respeetively. It is easy to eheek that 


dP jv* 
dQ/v"* 


= exp 


ityf 

n=l 
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Because N* < oo implies Y.n=i > pA, the probability in (80) has an upper bound 


dF 


N* 




;N* <oo 


N* 




-;N* <oo 


can be written as the sum 


N- < 4 

dF]\i* h 


h 


k=K-\-l 


^;4 <n- <(k+i)A 

dF^* h a 


(81) 


It is sufficient to show that h +12 can be bounded by as A —)■ cx) for some constant k that 
is sufficiently large. We provide upper bounds for Ji and I 2 separately. We start with an upper 
bound for Ji. Notice that under P..^, Y^, n = 1, 2,... are i.i.d. random variables and 

F^{Yn = a) = Aj and P^(F„ = —b) = 1 — 


where is defined in (28); then 


dFr, 


s #{i:Yi=a, and i<n} /. ~ x j^{v.Yi=-h, and i<n} 

) ( _ _ ^ ) < gO(a)n_ 


(82) 


dFr, \V ) V 1 - P 

The second inequality is due to < e“0+o(i))a according Proposition 1, and p = e““(l + o(l)) 
as a, 6 —)■ 00 . Consequently, 

Ji < < e°(^). (83) 


We proceed to an upper bound of l 2 - According to (82), we have 


h<^ sup ^ 


'Fi 




Y,Y,<a\. (84) 


k=K+l 


2=1 


^ l<n<A:— ,_i 
— — a * — 1 

The event | implies that : Yi = —b, and i < n} > Therefore, 




£ r, < A ] < q(#|7 : Y, = -b, and 7 < \_k^\] > 


i=l 
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Standard result on tail bound for binomial distribution (see, for example, [32]) yields 

An {k-l)A' 


: Yi = -b, and i < [k—\} > -^-) 


< exp 


\ik-l)A/bf 


< exp ( — ek—Y') < e 


—ekA 


(85) 


for some positive eonstant e that is independent of A and a. Combining (84) and (85), we have 

CX) 

12 < Y ^{k+l)o{A) ^-skA ^ 
k=K 


We complete the proof by combining the upper bounds for Ji and I 2 . 
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